Proteomic analyses do not reveal subclinical inflammation in fatigued patients with clinically quiescent inflammatory bowel disease

Fatigue is a common and clinically challenging symptom in patients with inflammatory bowel diseases (IBD), occurring in ~ 50% of patients with quiescent disease. In this study, we aimed to investigate whether fatigue in patients with clinically quiescent IBD is reflected by circulating inflammatory proteins, which might reflect ongoing subclinical inflammation. Ninety-two (92) different inflammation-related proteins were measured in plasma of 350 patients with clinically quiescent IBD. Quiescent IBD was defined as clinical (Harvey-Bradshaw Index < 5 or Simple Clinical Colitis Activity Index < 2.5) and biochemical remission (C-reactive protein < 5 mg/L and absence of anemia) at time of fatigue assessment. Leukemia inhibitory factor receptor (LIF-R) concentrations were inversely associated with severe fatigue, also after adjustment for confounding factors (nominal P < 0.05). Although solely LIF-R showed weak ability to discriminate between mild and severe fatigue (area under the curve [AUC] = 0.61, 95%CI: 0.53–0.69, P < 0.05), a combined set of the top seven (7) fatigue-associated proteins (all P < 0.10) was observed to have reasonable discriminative performance (AUC = 0.82 [95%CI: 0.74–0.91], P < 0.01). Fatigue in patients with IBD is not clearly reflected by distinct protein signatures, suggesting there is no subclinical inflammation defined by the studied inflammatory proteins. Future studies are warranted to investigate other proteomic markers that may reflect fatigue in clinically quiescent IBD.


Results
Cohort characteristics. In total, 350 patients were included, of which 188 patients were diagnosed with CD and 162 patients with UC. Patients with the lowest fatigue scores (belonging to the lowest 25% or 1st quartile, Q1) and highest fatigue scores (belonging to the top 25% or 4th quartile, Q4) were compared for their demographic and clinical characteristics ( Table 1). The proportion of patients with CD among those reporting the highest fatigue scores (Q4) was higher compared to patients with low fatigue scores (Q1) (63.6% vs. 50.0%, P = 0.06), while the proportion of patients with UC was correspondingly lower (36.4% vs. 50.0%). The proportion of females was higher among fatigued patients (P < 0.01). Age and body-mass index (BMI) distributions were similar between lowest (Q1) and highest (Q4) fatigued patients (P = 0.95 and P = 0.16, respectively). Patients with highest fatigue scores smoked more often compared to patients with low fatigue scores (P < 0.01). No differences were observed between patients having the lowest (Q1) vs. highest (Q4) fatigue scores with regard to the use of thiopurines (P = 0.90), TNF-α-antagonists (P = 0.25) or other types of IBD medications (Table 1). A comparison of cohort characteristics of the full study population, comparing patients with below-median and above-median (Q1-2 vs. Q3-4) fatigue scores can be found in Supplementary Table S1.

Associations of plasma protein concentrations and fatigue scores in patients with IBD.
Plasma protein concentrations were compared between patients with lowest fatigue scores (Q1, ranging from 0 to 3, n = 96) and patients with the highest fatigue scores (Q4, ranging from 6 to 10, n = 99) (Supplementary Table S2). None of the quantified plasma proteins concentrations were found to be differentially abundant after adjusting for multiple comparisons (Benjamini-Hochberg method, all FDR > 0.1). However, six (6) plasma proteins were observed to be nominally significant (P < 0.05). Among these proteins, leukemia inhibitory factor receptor (LIF-R), Delta and Notch-like epidermal growth factor-related receptor (DNER), glial cell line-derived neurotrophic factor (GDNF), and C-X-C motif chemokine ligand 10 (CXCL10) were lower in fatigued (Q4) patients, whereas concentrations of vascular endothelial growth factor A (VEGF-A) and T-cell surface glycoprotein CD5 (CD5) were found to be relatively higher in fatigued (Q4) patients (Figs. 2 and 3). The same analyses were repeated in patients with below-median (ranging from 0 to 4, n = 176) compared to above-median (ranging from 5 to 10, n = 174) fatigue scores (Supplementary Table S3), which showed fairly similar results.
After subdividing the cohort into patients with CD and UC and creating CD-and UC-specific quartiles of fatigue scores, plasma proteins were compared again between patients with Q1 (range: 0-3 for both CD and UC) and Q4 (CD: 7-10; UC: 6-10) fatigue scores (Supplementary Tables S4 and S5). Similar to the full cohort analysis, no plasma proteins were observed to be differentially abundant after adjustment for multiple comparisons (FDR > 0.1). Among patients with CD, plasma concentrations of extracellular newly identified receptor for advanced glycation endproducts binding protein (EN-RAGE, also known as S100A12 or calgranulin C) and GDNF were differentially abundant under nominal significance (P < 0.05), while among patients with UC, neurturin (NRTN), adenosine deaminase (ADA) and fibroblast growth factor (FGF-23) were nominally significant (P < 0.05) ( Supplementary Fig. S1). Table 1. Demographic and clinical characteristics of the study population compared between mildly fatigued (Q1) and severely (Q4) fatigued patients. Data are presented as proportions n with corresponding percentages (%) or as median [interquartile range, IQR] in case of continuous variables. P-values ≤ 0.05 were considered statistically significant. BMI body-mass index, CD Crohn's disease, IBD inflammatory bowel disease, TNF-α tumor necrosis factor alpha, UC ulcerative colitis. *Numericized lower limits of detection (< 5 mg/L) were included in calculating median and IQR, but may falsely represent the (unknown) true biological value. All CRP values were < 5 mg/L, as this was one of the study's inclusion criteria. † The use of TNF-α-antagonists included use of the following compounds: infliximab, adalimumab, golimumab and certolizumab pegol. www.nature.com/scientificreports/ Amongst the nominally significant proteins in the full cohort analysis, only GDNF was associated with the psychological well-being score (Spearman's ⍴ = 0.16, P < 0.05), while the remaining proteins showed weak, nonsignificant correlations ( Supplementary Fig. S2).
Leukemia inhibitory factor receptor (LIF-R) is most strongly associated with fatigue. Subsequently, logistic regression analyses were performed to evaluate associations between plasma proteins and the presence of fatigue in patients with IBD, while having the possibility to control for potential confounding factors. In univariable logistic regression analysis, proteins were identified that were associated with the presence of fatigue while adopting a pre-selection threshold of P < 0.10 ( Table 2). Multivariable analysis revealed gender and smoking behavior as relevant confounding factors, and when adjusted for these factors, an inverse association between plasma LIF-R concentrations and the presence of severe fatigue (P < 0.05) was observed (Q4 vs. Q1). All remaining proteins from univariable analysis were not significantly associated with fatigue after adjustment for confounding factors.  www.nature.com/scientificreports/ To evaluate the discriminative capacity of plasma proteins that were incorporated into logistic regression analyses, receiver operating characteristics (ROC) statistics were calculated with the corresponding area under the curve (AUC) as overall measure of fit (Table 3). Individual plasma proteins demonstrated almost no discriminative performance with regard to the presence of severe (Q4) fatigue compared with mild (Q1) fatigue. Among these proteins, the LIF-R protein demonstrated the highest, but still very weak discriminative value (AUC 0.61  Table 2. Univariable and multivariable logistic regression analyses for associations between plasma proteins and the presence or absence of fatigue in patients with IBD (defined as by median or by the lowest (Q1) versus highest quartile (Q4) of fatigue scores). Data are presented as odds ratios (ORs) with corresponding 95% confidence intervals (CI) and P-values. Bold P-values indicate nominal P-values < 0.05 in multivariable analysis. OR odds ratio, CI confidence interval, aOR adjusted odds ratio. *Significant confounding variables included in multivariable analysis were gender (male/female) and current smoking (no/yes). Combinations of plasma proteins demonstrate reasonable discriminative performance. Next, we aimed to evaluate the combined predictive value of all seven (7) plasma proteins that were incorporated into logistic regression analyses (P < 0.10) and subsequent ROC analyses (Tables 2 and 3) with regard to the presence of fatigue (Fig. 4) Fig. 4C).

Discussion
This study demonstrates that inflammatory proteins do not strongly associate with fatigue in patients with clinically quiescent IBD. Although we failed to detect differentially abundant plasma proteins between fatigued and non-fatigued patients, some potentially relevant proteomic signals were observed, albeit characterized by weak associations and only nominal statistical significance. Leukemia inhibitory factor receptor (LIF-R) was the top associated plasma protein in relation to fatigue scores, also after adjustment for gender and smoking as relevant confounders. Individual plasma proteins showed weak discriminative capacity between mildly (Q1) and severely  www.nature.com/scientificreports/ (Q4) fatigued patients, whereas a combined set of seven top associated proteins showed reasonable discriminative performance. Collectively, these findings imply that fatigue is not solely driven by overt subclinical inflammation and that individual inflammatory proteins do not serve as accurate biomarkers, which could aid in quantification of fatigue burden or expose novel avenues for therapeutic intervention. Fatigue is one of the most common and disabling symptoms of patients with IBD, and occurs in approximately half of patients with quiescent disease 6 . Since its multifactorial etiology remains elusive and its assessment rather subjective, there is an urgent need for fatigue biomarkers. Multiple studies have demonstrated that fatigue is associated with clinically active IBD and systemic inflammation 11,16,17 . In contrast, there are few examples of studies that focused on the potential role of a subclinical pro-inflammatory state in fatigued patients with clinically quiescent IBD. In a study that analysed immune parameters in relation to fatigue in patients with clinically quiescent IBD, systemic concentrations of pro-inflammatory cytokines IFN-γ, TNF-α, IL-12 as well as numbers of memory T-cells and neutrophils were higher among fatigued patients, whereas IL-6 and monocyte concentrations were lower 14 . Similarly, raised pro-inflammatory cytokine concentrations (IFN-γ, IL-6, IL-12, IL-17A) were reported in relation to fatigue in a pediatric IBD population 18 . In contrast, another study observed no significant differences in inflammatory markers (including CRP, calprotectin, IFN-γ, IL-6, IL-1β, TNF-α, IL-12, IL-17A) between fatigued and non-fatigued patients who were in deep remission, which is more in line with our findings 17 . Likewise, no associations were observed between CRP, IL-5, IL-8 and IL-12 concentrations and fatigue scores among 202 patients with IBD who were in clinical remission 19 . Furthermore, a recent prospective multiomics-based study that included proteomics data did not detect marked differences in inflammatory proteins when comparing fatigued and non-fatigued patients with clinically and endoscopically quiescent IBD 15 . Despite the clear observations that active IBD, as reflected by elevated concentrations of inflammatory markers, is significantly correlated with fatigue severity, fatigue in IBD is not necessarily solely driven by subclinical inflammation.
In the present study, LIF-R was the top associated plasma protein in relation to fatigue scores. LIF-R (CD118) is a receptor for leukemia inhibitory factor (LIF) and IL-6-like cytokines, and interacts with a high-affinity subunit, glycoprotein 130 (gp130), which act together in the oncostatin M (OSM) signaling pathway. LIF is a pleiotropic cytokine that affects proliferation, maturation and survival of a wide variety of body cells 20 . Heterodimerization of LIF-R or the OSM-specific receptor (OSM-R) with gp130 activates OSM, which is considered as an 'inflammatory amplifier' and drives intestinal inflammation in IBD (mainly by activation of JAK-STAT and PI3K-Akt pathways), leading to increased production of cytokines, chemokines and adhesion molecules 20,21 . OSM is also a marker for non-responsiveness to TNF-α-antagonists in patients with IBD 22 . Mucosal LIF-R expression has been shown to be decreased in biopsies of newly diagnosed IBD patients as well as in patients without endoscopic remission 21 . Therefore, it could be hypothesized that low circulating LIF-R concentrations, which may reflect subclinical inflammation, drive fatigue through modification of the pro-inflammatory response. This, however, should theoretically be accompanied by higher concentrations of LIF, which we were not able to investigate in the present study due to a very low detection rate (< 10%). LIF and LIF-R have also been associated with fatigue beyond IBD. For instance, LIF is known for its strong cachexia-inducing ability in animal models, whereas it is also associated with cancer-related fatigue and weight loss in humans 23,24 . In addition, decreased activity of LIF-R but increased LIF activity have been associated with neuro-inflammation 25 . LIF-R plays a crucial role in enhancing cellular survival of neural cells and confers anti-inflammatory effects via stimulation of other neuroprotective cytokines. Considering this, reduced shedding of the LIF-R protein might be related to fatigue in IBD through induction of pro-inflammatory phenotypes of T-lymphocytes, macrophages, or microglia.
In attempting to characterize an inflammatory protein signature for fatigue in patients with clinically quiescent IBD, we found a combination of LIF-R, VEGF-A, GDNF, IL-20RA, DNER, CD5 and EN-RAGE to have reasonable ability in discriminating between mild and severe fatigue.
Fatigued patients exhibited lower concentrations of glial-derived neurotrophic factor (GDNF) and Delta and Notch-like epidermal growth factor-related receptor (DNER). The observation of lower GDNF concentrations in fatigued patients may potentially reflect the disturbed intestinal barrier function as is characteristic for patients with IBD. In a murine colitis model, GDNF was shown to ameliorate intestinal epithelial barrier function through reducing epithelial permeability and inhibiting mucosal inflammation 26 . Similarly, GDNF concentrations were previously found to be decreased in the inflamed intestine of patients with IBD, where GDNF attenuated desmoglein 2 (DSG2)-associated impairment of intestinal barrier function 27 . Similar to GDNF, plasma DNER concentrations were decreased in severely fatigued patients compared with mildly fatigued patients. Previously, we demonstrated that DNER concentrations are strongly inversely associated with CRP concentrations 28 . DNER activates the Notch-1 signaling pathway, which is associated with improved mucosal barrier function 29 . As opposed to GDNF and DNER, plasma VEGF-A concentrations were increased in fatigued patients. VEGF-A is a well-known mediator in IBD by stimulating intestinal inflammation, angiogenesis and leukocyte adhesion 30 . Active IBD is characterized by increased VEGF-A concentrations in the blood and the inflamed intestinal mucosa 30 . Importantly, intestinal neovascularization in IBD occurs in a notoriously disorganized manner, and the newly formed blood vessels are highly permeable as is evident by associated edema 31 . This may lead to disruption of the intestinal endothelial barrier, which may also result in compromised epithelial barrier integrity via angiocrine communication 32 . Given the above considerations, one may speculate that these protein alterations in fatigued patients reflect impaired intestinal barrier function, which could in turn reflect subclinical inflammation and give rise to fatigue perception through gut-brain axis signaling.
Strengths of the present study include the extensive phenotypic characterization of the study cohort, together with an exact time matching (within 24 h) of sampling and fatigue assessment. Furthermore, we were able to select a large patient cohort only consisting of patients with clinical (HBI or SCCAI scores) and biochemical (CRP) remission in the absence of anemia. There are also several limitations to this study that warrant recognition. For example, no endoscopic assessments of disease activity or fecal calprotectin levels were available at time of sampling, which would have preferentially been used to confirm the quiescent or active state of the www.nature.com/scientificreports/ disease. Instead, we had to rely on clinical and serological assessment of disease activity, which necessitates cautious interpretation as some included patients might have had subclinical intestinal inflammation. Second, this study was of cross-sectional design and did not include follow-up data, which could have enabled us to investigate fatigue and protein concentration trajectories and thereby unravel the dynamics of the observed associations. Third, fatigue was assessed over a period of 24 h, whereas this is not an accurate representation of chronic fatigue that is experienced by patients with IBD having either quiescent or active disease. Likewise, psychological well-being was also assessed over a period of 24 h, which does not take into account other potentially existing co-morbidities such as mood disturbances, chronic anxiety, or the experience of stressful life-events. In addition, both scores were only assessed in a cohort of patients with IBD, but their distributions were not assessed in unaffected population controls. Furthermore, some information was missing that could potentially affect fatigue in these patients, including medical and psychological comorbidity relevant to fatigue, nutritional deficiencies, sleep quality, and more granular information on psychological well-being. Finally, no absolute protein quantification was performed as this would have been possible by using more traditional methods such as enzyme-linked immunosorbent assays (ELISAs). This limits the possibility to compare protein concentrations between each other or with concentrations reported in previous studies. Instead, relative protein quantification was achieved through PEA technology, which is accompanied by high sensitivity and high precision compared to other multiplex proteomics techniques.
Our results indicate that inflammation-related plasma proteins may not be the best proteomic or metabolic markers for defining biomarker signatures for fatigue in patients with clinically quiescent IBD. Instead, markers representing alternative pathophysiological mechanisms or biological systems may be more promising to further investigate. For example, a recently published explorative study found alterations in plasma lipid profiles reflecting fatigue in IBD, which were mainly characterized by disturbances within the arachidonic acid and sphingolipid pathways 33 . Another study has linked alterations in the gut microbiome and serum metabolome to persistent fatigue in patients with quiescent IBD, which supported the prevailing gut-brain axis hypothesis in fatigue pathophysiology 15 .
This study aimed to contribute to the need for defining biomarkers for fatigue, being an important and clinically challenging symptom in patients with IBD. Until date, only few studies focused on the pathophysiological mechanisms that may underlie fatigue in the context of IBD, whereas these efforts in elucidating mechanisms were rather limited to specific and well-characterized disease entities such as chronic fatigue syndrome (CFS) and fibromyalgia 15 . To that end, our results may also have implications for fatigue beyond IBD, as fatigue is also prevalent among various other autoimmune diseases, such as rheumatoid arthritis or multiple sclerosis 34,35 . Therefore, a broader search for biomarker signatures for fatigue merits further research in a variety of clinical contexts and even beyond pre-defined disease entities.
In conclusion, this study provides evidence that systemic inflammation, as reflected by circulating inflammatory proteins, may not be the primary driver of fatigue in patients with clinically quiescent IBD. Further, the LIF-R protein may potentially be involved in the pathophysiology of fatigue in IBD, which requires further validation. Future studies are warranted to investigate the potential of other proteins to quantify fatigue burden in patients with clinically quiescent IBD, preferably proteins that represent alternative pathophysiological pathways.

Study population.
Patients with an established diagnosis of IBD were included at the outpatient clinic of the University Medical Center Groningen (UMCG), Groningen, the Netherlands. Patients were included based on their participation in the 1000IBD project: a large, deeply phenotyped cohort consisting of over 1,000 patients with IBD living in the northern parts of the Netherlands 36 . Patients were enrolled in the 1000IBD project in the period from 2010 to 2019. In this study, patients were included when they were classified as being in clinical and biochemical remission at the time of their visit. Patients having anemia were excluded from the study. The study was approved by the Institutional Review Board (IRB) of the UMCG (registered as no. 08/338). All patients provided written informed consent for their participation in the study. The study has been performed in accordance with the principles of the Declaration of Helsinki (2013).

Data collection. Detailed information on demographic and clinical variables was registered for all patients,
including age, sex, body-mass index (BMI), smoking behavior, Montreal disease classification, medication usage, history of bowel surgery, and disease activity. All this information was assessed at the same time when plasma samples were collected for proteomic profiling. Clinical disease activity was assessed using the Harvey-Bradshaw Index (HBI) for patients with CD and the Simple Clinical Colitis Activity Index for patients with UC. Blood hemoglobin and C-reactive protein (CRP) concentrations were routinely measured as part of clinical care on the exact same date of plasma sampling.
Study outcomes and definitions. The primary study outcome was the subjective assessment of fatigue, which is part of the IBD-specific outpatient assessment in the UMCG and was evaluated by a single item during patients' visits to the outpatient clinic at the same date of plasma sampling: "How fatigued has the patient been during the last 24 h?" (in Dutch: "In hoeverre heeft u last gehad van vermoeidheid in de afgelopen 24 uur?") 8 Answers to this item were based upon a visual analogue 10-point Likert scale, ranging from a score of 1 (being not fatigued at all) to 10 (being severely fatigued) (Supplementary Fig. S4). The secondary study outcome was an assessment of psychological well-being within 24 h, which was evaluated by the following item: "How does the patient feel [during the last 24 h]?" (in Dutch: "Welk rapportcijfer geeft u uw algemeen welzijn over de afgelopen 24 uur?"), and rated in a similar manner based on a visual analogue 10-point Likert scale, ranging from 1 (very bad) to 10 (excellent). Clinical remission was defined as an HBI < 5 for CD or SCCAI < 2.5 for UC 37  www.nature.com/scientificreports/ cal remission was defined as serum C-reactive protein (CRP) concentration < 5 mg/L. Anemia was defined as a hemoglobin concentration < 8.5 mmol/L for men and < 7.5 mmol/L for women, based on the Dutch national reference ranges 38 .
Proteomic profiling. Plasma concentrations of 92 different inflammation-related proteins were quantified using proximity extension assay (PEA) technology using the ProSeek Multiplex Inflammation panel (Olink Pro-teomics®, Uppsala, Sweden) (see Supplementary Table S6 for a full list with names, abbreviations, detection rates  and UniProt IDs). Plasma samples were measured in the Olink® testing facility in Uppsala, Sweden, where 92 matched oligonucleotide-labelled antibody pairs (probes) were incubated with the samples and allowed to pairwise bind to the target proteins within the sample. Hybridization occurs when two probes of the same type are brought together in close proximity, which is followed by DNA polymerase extension. Finally, the corresponding DNA sequence is detected and amplified with real-time microfluidic quantitative polymerase chain reaction (qPCR) (Biomark HD Instrument, Fluidigm®, San Francisco, CA, USA) 39 . Plasma samples were randomized on experimental plates using a randomization algorithm, which ensured randomization by age, sex and IBD subtypes (CD or UC) in order to reduce technical variation. An inter-plate intensity normalization procedure was performed prior to analysis of the data, which uses the overall median of the experiment as normalization factor. Subsequently, protein data were normalized on a log2-scale, where values were derived from inverted Ct-values of the real-time qPCR, expressed as normalized protein expression ) showed a very low detection rate (< 10%) and were removed for further analyses across all samples. Furthermore, tumor necrosis factor alpha (TNF-α) values were removed, as the measurement of this protein was excessively perturbed by anti-TNF-α antibodies-bound TNF-α (e.g., infliximab or adalimumab). As a consequence, the Olink TNF-α assay used for this study (ref no. 95302) delivered suboptimal results, as it employed polyclonal antibodies against TNF-α that also detect its monomeric forms, resulting in the simultaneous detection of biologically inactive forms 40,41 . Finally, proteins with NPX values below the detection limit were scored as missing values, because their inclusion in the analysis did not alter the eventual results. After data processing, a total of 83 different proteins were available for analysis for a total of 350 patients with IBD (188 CD, 162 UC).

Statistical analysis.
Baseline characteristics of the study population were presented as means ± standard deviations (SD), medians with interquartile ranges (IQR) or as proportions n with corresponding percentages (%). Assessment of normality was performed by visual inspection of normal probability (Q-Q) plots, histograms and kernel density plots. Differences in demographic and clinical data were compared using independent sample t-tests and one-way analysis of variance (ANOVA), Mann-Whitney U-tests or Kruskal-Wallis tests, or chisquared tests, depending on the number of independent groups and type of variables. Patients were divided into quartiles (Q1-Q4) of fatigue scores (1-10), where plasma protein concentrations were compared using Mann-Whitney U-tests or Kruskal-Wallis tests. Univariable logistic regression analysis was performed to assess the association between plasma proteins and either above-/below-median fatigue scores (Q1-2 vs. Q3-4) or severe versus mild fatigue (Q1 vs. Q4). Multivariable backward logistic regression analysis was performed to adjust for relevant confounding variables, which were derived from univariable analysis (pre-selection threshold: P < 0.10). Receiver operating characteristics (ROC) statistics with the area under the curve (AUC) as overall measure of fit and corresponding 95% confidence intervals (CI) were used to assess the discriminative ability of plasma proteins with regard to the binary outcomes. ROC curves and AUCs were computed using the non-parametric, tiecorrected trapezoidal approximation method. Discriminative performance of the multivariable-adjusted models was determined by ROC estimation of the combined predicted probabilities from the models. In addition, fitted logistic regression models were internally validated using k-fold cross-validation (k = 10). In this procedure, the dataset was randomly divided into k equally sized folds, where each fold was then left out (10% of cases) while the model was fitted against the remaining k-1 folds (90% of cases, the 'training set') and predictions were obtained for the left-out part (the 'test set'). This procedure was repeated ten times, where AUCs from each fold were averaged and bootstrapped to achieve statistical inference, resulting in a cross-validated AUC (cv-AUC). Statistical analysis was performed using the Python programming language (v.3.8.5, Python Software Foundation, https:// www. python. org), using the pandas (v.1.2.3) and sklearn (v.0.24.1) modules and the SPSS Statistics software package (v.25.0) (SPSS Inc., Chicago, IL, USA). Data visualization was performed using seaborn (v.0.11.1) and matplotlib (v.3.4.1) packages in Python. P-values ≤ 0.05 were considered statistically significant.

Data availability
The datasets used and/or analysed during the current study are available from the corresponding authors on reasonable request. The data for the Groningen 1000IBD cohort can be requested at the European Genome-Phenome Archive data repository with the accession number: EGAS00001002702.